Calibration of microphone arrays for improved speech recognition
نویسندگان
چکیده
We present a new microphone array calibration algorithm specifically designed for speech recognition. Currently, microphone-array-based speech recognition is performed in two independent stages: array processing, and then recognition. Array processing algorithms designed for speech enhancement are used to process the waveforms before recognition. These systems make the assumption that the best array processing methods will result in the best recognition performance. However, recognition systems interpret a set of features extracted from the speech waveform, not the waveform itself. In our calibration method, the filter parameters of a filter-and-sum array processing scheme are optimized to maximize the likelihood of the recognition features extracted from the resulting output signal. By incorporating the speech recognition system into the design of the array processing algorithm, we are able to achieve improvements in word error rate of up to 37% over conventional array processing methods on both simulated and actual microphone array data.
منابع مشابه
شکلدهی وفقی و هوشمند پرتو در آرایههای میکروفونی Ad-hoc با استفاده از خوشهبندی و رتبهبندی میکروفونها
Considering the existence of a many speech degradation factors, speech enhancement has become an important topic in the field of speech processing. Beamforming is one of the well-known methods for improving the speech quality that is conventionally applied using regular (classical) microphone arrays. Due to the restrictions in the regular arrangement of microphones, in recent years there has be...
متن کاملEnabling Speech Applications using Ad Hoc Microphone Arrays
Microphone arrays are central players in hands-free speech interface applications. The main duty of a microphone array is capturing distant-talking speech with high quality. A microphone array can acquire the desired speech signals selectively by leading the beampattern towards the desired speaker. The foreseen application of ubiquitous sensing motivated by the abundance of microphone-embedded ...
متن کاملPhone-based filter parameter optimization of filter and sum robust speech recognition using likelihood maximization
Because of noise and reverberation, accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods for using microphone arrays have been proposed that can be classified into two main approaches: systems ...
متن کاملMicrophone array speech recognition: experiments on overlapping speech in meetings
This paper investigates the use of microphone arrays to acquire and recognise speech in meetings. Meetings pose several interesting problems for speech processing, as they consist of multiple competing speakers within a small space, typically around a table. Due to their ability to provide hands-free acquisition and directional discrimination, microphone arrays present a potential alternative t...
متن کاملMultiple regression of log-spectra for in-car speech recognition
This paper describes a new multi-channel method of noisy speech recognition, which estimates the log spectrum of speech at a closetalking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by distributed microphones. The advantages of the proposed method are as follows: 1) The method does not require a sensitive geometric layout, calibration of the s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001